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We present different classes of solutions to the Firing Squad Synchronization Problem 
on networks of different shapes. The nodes are finite state processors that work at unison 
discrete steps. The networks considered are the line, the ring and the square. For all of these 
models we have considered one and two-way communication modes and also constrained the 
CN) | quantity of information that adjacent processors can exchange each step. We are given a 

particular time expressed as a function of the number of nodes of the network, f(n) and 
present synchronization algorithms in time n 2 , nlogn, n^Jn, 2 n . The solutions are presented 
■ as signals that are used as building blocks to compose new solutions for all times expressed 

^"*^ | by polynomials with nonnegative coefficients. 

1 Introduction 

The famous firing squad synchronization problem (FSSP), is an old problem posed by Myhill in 
■^j- \ 1957 (in print in 18 ). In terms of Cellular Automata, we are given a line of n identical cells 

(finite state machines) that work synchronously at discrete time steps, initially a distinguished 
cell (the so called general) starts computing while all others are in a quiescent state; at each 
time step any cell sends/receives to/from its neighbours some information about their state at 
the preceding time: the problem is to let all cells in the line enter the same state, called firing, 
for the first time and at the very same instant, the firing time. 
q ! In literature many solutions to the original problem and to some variations of it have been 

given. The early results all focused on the synchronization in minimal time: Minsky in |17j 
showed that a solution to the FSSP requires at least 2n — 1 time, Waksman |22| and Balzer 
gave the first solution in this minimal time and Mazoyer in 14 constructed a minimal time 
solution with the least number of states to date: six. In it has also been shown that five 
states are always necessary for a solution. 

A significant amount of papers have also dealt with some variations of the FSSP. These 
variations concerned both the geometry of the network and some computational constraints. In 
the following we briefly recall some of them. The FSSP has been studied on a (one-way) ring 
of n processors jU^^' on arravs of two and three-dimensions [211 18]: in all these papers all the 
results focused on lower and upper bounds on the minimal time for the synchronization. In the 
very recent paper the cells of the network are placed along a path in the two-dimensional 
array space, there a combinatorial problem (for which only exponential algorithms are known) 
is reduced to the existence of an optimal solution to the FSSP on this path. In [2U] solutions for 
the Cayley graphs are given and in ^2] a particular class of graphs is studied and for this class a 
solution in time 3r + 1 or 3r is given, where r is the longest distance between the general and any 
other node (the radius) of the graph. Some constrained variants of the FSSP have concerned 
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solutions on the interesting model of reversible CA (i.e., backward deterministic CA) [Oj and 
CA with a number-conserving property (i.e., a state is a tuple of positive integers whose sum 
is constant during the computation) [2j . Other kinds of constraints which have been considered 
concern the amount of information exchanged between any pair of adjacent cells. In |151 I12| 
the network is a line of cells that can exchange only one bit, that is at each time step each cell 
sends/receives only one bit of information to/from the adjacent cells instead of its whole state. 
Finally let us recall the significant work of [2] where the FSSP is studied in a distributed setting 
(no global clock, but lock-step synchrony) with bounds on the number of faulty processors. 

In this paper we consider the problem on various networks (line, ring, square), and for one 
and two-way communication modes, but with a new approach with respect to the past: we 
hypothesize we are given the firing time and we ask for a synchronization algorithm in this time. 
This is an interesting and challenging theoretical problem, which is also directly connected to the 
sequential composition of cellular automata. Given two cellular automata A and B computing 
respectively the functions / and g, the sequential composition of A followed by B is the cellular 
automaton obtained in the following way: first A starts on a standard initial configuration and 
when it has done with its computation, B starts using the final configuration of A as initial 
configuration. The resulting automaton clearly computes g o f. In order to compose the two 
automata it is necessary to synchronize all the cells that will be used by B at the time A 
computes /. 

Some of the results presented here are a revisiting and a generalization of some results of 
I12[ILS|. anyway here we present a whole framework of signals that, informally speaking, is a set 
of cells that at a given time receive or send a particular state. We then define some basic signals 
(building blocks) and give some rules to combine them to obtain other new signals. This modular 
approach allows to design synchronizing algorithms in a very natural way also simplifying their 
understanding and descriptions. Moreover here we introduce also as a parameter the number 
of bits that can be simultaneously transmitted at each step. We study networks where at each 
step a cell can transmits to each of its neighbours at most c bits, c > 1. 

As said above the communication between adjacent cells can be in both directions or only 
in a direction. We thus consider either networks where a cell can exchange information with all 
its neighbours, or networks where for each cell, only a predetermined half of its neighbours can 
send information to it while the other half can only receive information from it (the information 
flow is unidirectional). In this second case, to guarantee the communication from a cell to all 
the others, we consider circularly shaped networks. 

For all the considered networks we prove a lower bound on the time of a synchronization, 
then we prove its tightness by giving a matching synchronization. We obtain families of solutions 
to the considered variants of the FSSP in several times t(n), where n is the number of nodes 
of the network. The approach we follow is compositional: we first describe basic synchronizing 
algorithms and then we give general rules to compose synchronizations. The basic synchroniza- 
tions in turn are obtained by composing elementary signals, which can be seen as fragments 
of cellular automata. A synchronization is thus a special signal obtained as a composition of 
many simpler signals. Compositional rules for both signals and synchronizations include parallel 
composition, sequential composition, and iterated composition. We also state some sufficient 
conditions to apply them. In the parallel composition we start many synchronizations or sig- 
nals, all at the same time. In some cases, this composition can be used to select among different 
synchronizations depending on the number of cells in the network. Sequential composition ap- 
pends a synchronization (or a signal) to the end of another signal, possibly with a constant time 
offset. This way we are able to construct a synchronization in time t\{n) + t2(n) + d, for d > 0, 
if there exist synchronizations in time t\{n) and t2(n). If we are given two synchronizations 
respectively in time t\{n) and t2(n), the iterated composition consists of iterating t2(n) times 
the synchronization in time t\(n), thus obtaining a new synchronization in time ti(n) ■ t2(n). 
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Compositions of synchronizations are used to determine synchronizations in a "feasible" time 
expressed by any polynomial with nonnegative coefficients. Finally, we give a construction to 
"inherit" synchronizations on two-dimensional networks starting from synchronizations of the 
corresponding linear networks. We show that an (n x n) array of cells can be seen as many 
lines of (2n — 1) cells (each of them having as endpoints cells (0,0) and (n — l,n — 1)) and 
a given synchronization on a line can be executed simultaneously on all these lines. Thus we 
can synchronize an (n x n) array in time t{2n — 1), provided that there exists a synchronizing 
algorithm for a line of k cells in time t(k). 

As building blocks for the compositional rules we give synchronizing algorithms in some 
common functions: ra 2 , n[~logn], nfi/ril and 2 n . To synchronize a line of n cells in time t(n) 
we first design some basic signals and then we compose them to obtain an overall signal that 
starts from the leftmost cell and comes back to it in exactly (t(n) — 2n + 1) time units; then 
a minimal time synchronization starts, synchronizing the n cells in time t(n). To obtain a 
synchronization in time t[n) of an array of (n x n) cells we use the following approach: first 
synchronize a row in time ti(n) then start a synchronization in time t2(n) on all the columns 
such that t(n) = ti(n) + t2(n). 

It is worth noticing that the composition rules also apply to the general case of (m x n) 
arrays. Thus all the synchronizations given for an (n x n) array can be extended to an (m x n) 
array, considering the time of the synchronization as a function of either morn. 

The remainder of this paper is organized as follows. In section [2] we give the definitions 
and introduce the notation we will use throughout the rest of the paper. In section we give 
tight lower bounds on the time synchronization of c-CA and solutions in minimal time. In 
section0]the framework of the signals is presented formally. In section some composition rules 
on synchronizations are defined. In sections H3 and [7| solutions in the given times n 2 , n[~logn], 
nf-y/n] and 2 n are given for the two-way and one-way communication models, respectively. As 
an application of the compositional rules to obtain new synchronizations, in section |H] we show 
how to obtain polynomial-time synchronizations on all the considered models. The conclusions 
are in sectional 

2 Preliminaries 

In this section we give the basic definitions, introduce the models, which are generalizations of 
the well known model of cellular automata, and define our synchronization problem. 

The models. A cellular automaton is an array of pair-wise connected finite-state machines, 
called cells (or sometimes processors), which operate synchronously at discrete time steps. We 
consider both one-dimensional and two-dimensional cellular automata. The connections be- 
tween cells may be either one-way or two-way links. We consider a generalization of the known 
cellular automata since in our models the capacity of the channels, and then the communication 
complexity, may vary. We call a c-link a channel being able to transfer c bits simultaneously. 
All the cells are indistinguishable, anyway for descriptive reasons, in a one dimensional array of 
n cells we will number them starting from 0; moreover cell and cell n — 1 are said boundary 
cells. Unless stated otherwise, in the following n is the number of cells of the one-dimensional 
cellular automaton. 

The behaviour of each cell is in accordance to finite state transition functions depending on 
both the state of the cell and the output given at the preceding step by some of the connected 
cells. We define a function N : {0, . . . , n — 1} — ► {0, . . . , n — 1}* which determines the neigh- 
bouring cells on which the transition function of a given cell depends. This function depends on 
whether the connections are one-way or two-way-links and may also vary for different cells (for 
example, in the case of the boundary cells). For a cellular automaton A, we denote by rriA the 
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Figure 1: The one-dimensional and two-dimensional cellular automata. 



maximum length of N(i), for < i < n — 1. 

A c-Line is a one dimensional cellular automaton where the connections are two-way c-links 
and where the i-th cell is connected to the (i— l)-th and (i + l)-th cells, for < i < n— 1, the first 
cell is connected only to the second cell, and the last cell is connected only to the (n — 2)-th cell, 
thus N(i) = + for < i < n-1, N(0) = (1) and JV(n-l) = (n-2) (see Figured a). 

A c-Ring is a one dimensional cellular automaton with two-way c-links with a connection also 
between the first and the last cell (see Figure U\c). Thus it has a circular shape and the length of 
N(i) is two, for every i. Finally, a c-ORing is a one dimensional cellular automaton with one-way 
c-links such that a connection exists also between the first and the last cells. The c-ORing has 
also a circular shape with N(i) = (i — 1), for every i > 0, and N(0) = (n — 1). Thus tua = 1: 
the i-th. cell receives only the output of the (i — l)-th cell. 

The two-dimensional case is a natural generalization of the already considered models. In a 
two-dimensional array of n x n cells, the cells are numbered starting from (0,0). In what 

follows n x n is always the number of cells of the two-dimensional cellular automaton. Each 
cell (i,j), except for the boundary cells, is connected to cells (i — l,j), (£, j — 1), (i + and 
(i,j + 1). In this case, if the connections are two-way links, then N(i,i) = ({i — — 
1), (i + + 1)) and, with one-way connections, N(i,j) = ((i — — 1)). 

We consider a c-Square, where the connections are two-way c-links and each boundary cell is 
connected only to the neighbouring cells (see Figure ^b). For example, in this network the cell 
(0,0) is connected to the cells (0, 1) and (1,0), while a cell (i,0) is connected to the cells (i, 1) 
and (i — 1,0) and (i + 1,0). On the other hand, we can define the c-Square of Rings, where, 
similarly to the first and last cells in the c-Ring, the boundary cells are pair-wise connected, and 
c-Square of ORings where the connections are one-way c-links (see Figure ^d). 

For simplicity, we do not consider the rectangular models, that is those obtained from arrays 
of m x n cells. Many of the results in this paper can be extended to this case. Figure 121 
summarizes the considered models with respect to both the paradigms non-circular vs. circular 
and two-way vs. one-way links. Observe that we do not consider the non-circular models with 
one-way links. These models are not meaningful in this context. 

To define the behaviour of all the introduced models, we use the symbol Q referring to the 
set of states of a given cellular automaton A. Different transition functions are defined for 
different communication complexities. If we consider c-links then for the non-boundary cells 



the transition function is 5 : D C A — * D C A , where D C A is the set of tuples (q,s\, 
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Figure 2: Models of cellular automata. 



q £ Q and Sj 6 {0, 1} C . In the non-circular models we should also define transition functions for 
the boundary cells (recall these cells are connected to less adjacent cells). We omit the formal 
definitions of these functions since they are quite standard and can be easily obtained by the 
definition for the non-boundary cells. The behaviour of a cell i can be described as follows. Let 
S(q, r±, ■ ■ ■ , r m ) = (p, s\, ■ ■ ■ , s m ), if a cell i is in the state q and receives r±, ■ ■ ■ ,r m from the cells 
in N(i), then it enters the state p and sends the words s±, ■ ■ ■ , s m . (Note that this definition 
is symmetric: the number of words that each cell sends coincides with the number of received 
words.) 

Note that in the standard definition of cellular automaton each cell can send to its neigh- 
bouring cells just its state. Therefore, in this paper, whenever we consider a model with link 
capacity c such that c > [log |Q|], we will omit the index c (that is, we will just speak about a 
Line, Square, etc., instead of a c-Line, c-Square, etc.). Some of the results given in this paper 
hold for all the models, thus we will speak about a c — CA to mean any of the models above 
with link capacity c. 

A configuration of a one dimensional cellular automaton with c-links is a mapping C : 
{0, . . . ,n — 1} — > D C A . At time t, a configuration gives, for each cell i, the state entered and 
the words of bits sent at this time. A starting configuration is a configuration at time 1. In the 
following we often write "(A,C) n to denote a cellular automaton A starting on a configuration 
C. We consider the time-unrolling of A, that is a time -space array. A pair (i, t) in this array, 
with < i < n and t > 1, is called a site, and denotes the cell i at time t. The state of the cell 
i at time t is denoted by state(i,t) and the words of bits sent to the neighbours are denoted by 
left(i,t) and right(i,t). Sometimes, to avoid ambiguities, we will use stateA(i,t), left A (i,t) and 
right A (i,t) to denote the state or the words of bits sent by a cell at time t in a fixed cellular 
automaton A. A site (i, t) is said to be active if either it changes its states at the next step, or 
sends/receives a words different from 0, that is when one of the following conditions holds: 

• state(i, t) / state(i, t + 1), 

• either left(i, t) ^ or right(i, t) / 0, 

• there is i! 6 N(i) such that either left(i f , t — 1) ^ or right(i f , t — 1) ^ 0. 

In the two-dimensional configuration is defined in a natural way and the time- 

unrolling consists of triple (i,j,t), with < i,j < n and t > 1, denoting the cell (i,j) at time t. 
The state of the cell at time t is denoted by state(i, j,t). 

The problem. Here we introduce a synchronization problem which generalizes the so called 
Firing Squad Synchronization Problem (FSSP). Among the states of the considered cellular 
automaton, there are three distinguished states: G the General state, L the Latent state, and 
F the Firing state. The state L, also said quiescent as well, has the property that if a cell in 
state L receives all words from its neighbours it remains in the same state and sends the word 
to its neighbours. A standard configuration is a configuration where the cell (respectively 
cell (0, 0) in the two-dimensional case) is in state G and sends a word different from to each 
neighbour and all the other cells are in state L and send the word 0. 

A synchronization in time t(n) is a cellular automaton such that, starting from a standard 
configuration, all cells enter state F at time t{n) for the first time. We will speak about a 
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synchronization of a c-Line, c-Square, etc. when the cellular automaton is a c-Line, a c-Square, 
etc. Moreover, a cellular automaton which provides a synchronization in time tin) is also called 
a solution in time t(n) of the FSSP, or simply a solution. 

We introduce now two variations of the problem whose solutions are sometimes useful to 
synchronize CA. A Two-End synchronization in time t(n) is a Line such that at time t(n) 
all cells enter for the first time the state F, starting from a configuration which differs from 
the standard one because both the cell and the cell n — 1 are in the state G. A Four-End 
synchronization in time t(n) is a Square such that at time t(n) all cells enter for the first time the 
state F, starting from a configuration having the cells (0, 0), (0, n — 1), [n — 1, 0), (n — 1, n — 1) 
in the state G and the other cells in the Latent state. 

It is simple to see that the synchronizations of cellular automata with different communica- 
tion complexity are not unrelated problems. Actually, a synchronization of a c-CA can be seen 
as a synchronization of a c'-CA for every d > c. In particular we will often use the following 
propositions: 

Proposition 1 If there is a synchronization of a 1 — CA in time tin), then there exists a 
synchronization of a c — CA in time tin), for any c > 1. 

Note that in literature the time taken by a synchronization is sometimes expressed in terms 
of the number of steps, see for example [UIB], an< ^ sometimes with the number of successive 
configurations, see for example |14| I12j. In this paper the time is expressed by the number of 
configurations. 

3 Minimal Time Solutions 

In this section we give tight lower bounds on the time of synchronizations of c — CA and present 
the algorithms for the synchronization in minimal time. 

3.1 Lower Bounds on the Time of the Synchronizations 

A synchronization of a c-Line requires at least time 2n — 1 . Intuitively, this is the minimal time 
for the first cell to wake up all the other cells and to get back the message that all the cells have 
been awakened. Recall that in a starting configuration each cell, except the first, is in a Latent 
state and the cell i can leave the Latent state not before than time i + 1. Thus all the cells are 
awake at time n, and the first cell gets this information back at time 2n — 1. 

As regards the two-dimensional cellular automaton, Shinahr j^J has shown that the mini- 
mum time for synchronizing a rectangular array ofmxn cells is n + m + max(n, m) — 2, but this 
time reduces to 2n — 1 in the case of a Square. The following lemma summarizes these results. 

Lemma 1 Every synchronization of a c-Line or a c-Square has time greater than or equal to 
2n - 1. 

The minimum time to synchronize a Ring or Square of Rings is at least, as above, the time 
required by the first cell to send a message to all the other cells and to get the information back. 

Lemma 2 Every synchronization of a c-Ring or a c-Square of Rings has time greater than or 
equal to n + 1. 

In the next Lemma we show that time 2n is necessary to synchronize a c-ORing and in 
Lemma |U we show that the minimal time is 3n — 1 for a c-Square of ORings. 

Lemma 3 Every synchronization of a c-ORing has time greater than or equal to 2n. 
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Proof : Assume by contradiction that there exists a synchronization within time t(n) < 2n of 
a ORing (say A) and let B be an ORing which differs from A just for the size: B has 2n cells 
instead of n. Since for all t < n, states (n — l,t) = L and stateB(2n — l,t) = L, then i(n) > n 
and state a(i, t) = states {h t) f° r all < i < n — 1 and 1 < t < n. Observe that the state of the 
cell n — 1 at time n + t, for < t < re, depends on the states at time n of the following cells: 
the cells re — 1 and re — 2, when t = 1, the states of the cells n — 1, n — 2 and n — 3, when t = 2, 
and in General on the states of the cells n — 1, . . . ,n — t — 1 for 2 < t < n. As a consequence, 
stated (re — l,t) = statesin — 1, t) for 1 < t < 2ra. If i(ra) < 2n, then at time i(n) the cell n — 1 
of both A and -B will enter the state .F. Anyway the cell 2n — 1 of B at time i{n) is still in the 
state L, thus we have a contradiction. ■ 

Lemma 4 Every synchronization oj c- Square of ORings has time greater than or equal to 3n— 1. 

Proof : Assume by contradiction that there exists a synchronization A in time i(n) < 3n — 1 of 
a Square of ORings and let and B be a Square of ORings which differs from A for the number of 
cells: 2n x 2n instead of n x n. Since for all t < n and < i < n— 1, states (i, n — 1, t) = statcAin — 
l,i,t) = L and states{i,2n — l,t) = states{2n — l,i,t) = L, then stateA^i, j,t) = states(i, j,t) 
for all < i,j < n — 1 and 1 < t < n. Furthermore, for both A and B the state of cell at 
time n is L for all cells (z, j) such that £ + j > n — 1. The state of the cell (n — 1, n — 1) at time 
n + 1, for < t < i{n) — n, depends on the states at time n of the cells (n — 1 — u, n — 1 — v), 
for u + v < t. As a consequence, at time t(n) the cell (n — 1, n — 1) of both A and S will enter 
the state F. Anyway since the cell (2n — 1, 2n — 1) of B at time t(n) is still in the state L, we 
have a contradiction. ■ 



3.2 Synchronization in Minimal Time for Two-way Communication Net- 
works 

In this subsection we present the minimal time algorithms for the synchronization of the models 
whose connections are two-way links. The Proposition ^ allows us to prove the statements only 
for the case c = 1. 

Waksman in [22^ gave the first solution to the problem of synchronizing a Line in the minimal 
time 2n — 1, and Mazoyer, in ^3] showed that a minimal time synchronization exists for a 1-Line. 
Moreover, Shinahr j^J has shown the minimal time solution for a Square. In ^Uj) the approach 
by Shinahr is combined with the solution by Mazoyer to obtain a minimal time synchronization 
of a 1-Square. 

Lemma 5 For every link capacity c > 1, there is a synchronization of a c-Line and of a c- Square 
in time 2n — 1. 

The above synchronizations can be used to obtain a Two-End synchronization of a Line and 
a Four-End synchronization of a Square in time n as shown in the following lemma. 

Lemma 6 There are a Two-End synchronization of a Line in time n and a Four-End synchro- 
nization of a Square in time n. 

Proof : The Two-End synchronization in time n can be obtained by considering a line as split 
in two halves and synchronizing each of them separately by a minimal time solution. This can 
be implemented by just starting a minimal time solution from both ends. In fact, each cell can 
determine its membership to a sub-line at the time it moves from the Latent state: this happens 
by a communication received from its left neighbour (membership to the left half-line), or from 
its right neighbour (membership to the right half-line). Note that, in case n is odd, the central 
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Figure 3: The frames in a Square ofnxn processors. 



cell belongs to both half-lines, while when n is even, the central cells start acting as the last 
cells of their half-lines with 1 time unit of delay (at the time they receive a communication from 
the other half- line). Therefore, in both cases the Line is synchronized in time n. 

Consider now a Square. We rearrange it in n concentric frames, where the (i + l)-th inner 
frame is constituted by the four lines (i, i) . . . (i, n — i — 1), (i, n — i — 1) . . . (to — i — 1, to — i — 1), 
. . . (n — i — 1, i) and (n — i — . . . (n — i — l,n — i — 1), see FigureEl Suppose now that 
the cells (0,0), (0, n — 1), (to — 1,0) and (n — l,n — 1) are all in the same General state. The 
four lines of the first frame can all synchronize in time n using the above result on the Two-End 
synchronization of a Line; during such synchronizations, after the first two steps, the four cells 
(1, 1), (l,n — 2), (n — 2, 1) and (to— 2, to — 2) all enter a General state and thus the four lines of the 
second frame can synchronize in time n — 2. Iterating this argument, the i-ih frame synchronizes 
in time n — 2(i — 1), 1 < % < [to/2] . As this synchronization starts at time 2(i — 1) + 1, then the 
overall time to synchronize the processors is still n. ■ 

The synchronizations sketched in the above proof do not work when the link capacity is 1. 
The main reason is that synchronizations of 1-CA critically use the parity of the time a bit 1 
is received to distinguish between different messages. In particular, each cell i expects an even 
time delay between the message sent by the General to wake up all cells and the reply sent 
by the last cell in the Line (in a minimal time solution the last cell replies as soon as it gets 
awakened). In the schema sketched in the proof of Lemma El for the Two-End synchronization 
of a Line, when to is even, the central cells delay the response of 1 time unit. Therefore, the 
reply message would be misunderstood by all the other cells, unless we delay it by another time 
unit. This is the idea exploited in the solution given in ^Hj. Therefore, we have the following 
lemma. 

Lemma 7 There are a Two-End synchronization of a Line in time 2 \n /2j + 1 and a Four-End 
synchronization of a Square in time 2[n/2\ + 1. 

Note that the minimal time synchronization of a 1-Line by Mazoyer |15j . can be modified to 
work for a 2-Line without relying on the parity of delays to recognize messages (we simply use 
the second bit to do that). Thus, it is easy to verify that the schema sketched in the proof of 
Lemma can be adapted to work for a 2-CA using techniques similar to that used in ^U] for 
the 1-CA. Therefore, the following lemma holds. 

Lemma 8 For every link capacity c > 2, there are a Two-End synchronization of a Line in 
time n and a Four-End synchronization of a Square in time n. 
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The following lemma states that the lower bounds given in the previous section for c-Ring 
and c-Square of Rings are tight for c > 2. Note that, for the Ring, a similar, but not correct 
result, can be found in [I]. 

Lemma 9 For every link capacity c> 2, there is a synchronization of a c-Ring and of a c- Square 
of Rings in time n + 1. Moreover, there is a synchronization of a 1-Ring and of a 1- Square of 
Rings in time 2\n/2~\ +1. 

Proof : A c-Ring can simulate a Two-End synchronization of a c-Line of n + 1 cells, so 
obtaining a synchronization in time n + 1. Actually, the cell can act as both the boundary 
cells of the c-Line. 

A synchronization of a c-Square of Rings in time n + 1 can be obtained by looking at this 
Square as split in three parts: the first row, the first column and the remaining of the array, 
that is a subarray of (n — 1) x (n — 1) cells. As we have just noticed, the first row and the first 
column can be synchronized in time n + 1. During these synchronizations (in the first two steps) 
the cells (1, 1), (l,n — 1), (n — 1, 1), (n — 1, n — 1) can enter a new state acting as a General 
state of a Four-End synchronization of a Square of (n — 1) x (n — 1) cells. Using Lemmas Eland 
[7| and considering that this last synchronization starts with a two step delay, we get the stated 
results. ■ 

We can give now the main results of the section. 

Theorem 1 

• For every link capacity c > 1, there is a synchronization of a c-Line and of a c-Square in 
time 2n — 1; moreover, every synchronization of a c-Line or a c-Square has time greater 
than or equal to 2n— 1. 

• For every link capacity c > 2, there is a synchronization of a c-Ring and of a c-Square of 
Rings in time n + 1, and there is a synchronization of a 1-Ring and of a 1-Square of Rings 
in time 2[~n/2] + 1; moreover, for every link capacity c > 1, every synchronization of a 
c-Ring or a c-Square of Rings has time greater than or equal to n + 1. 

We observe that there is a gap between the shown lower and upper bounds for the synchro- 
nization of a 1-Ring and a 1-Square of Rings only for when n is odd. 

3.3 Synchronization in Minimal Time for One-way Communication Networks 

The following two lemmas state that the lower bounds given in the previous section for the 
models using one-way links are tight. 

Lemma 10 There is a synchronization of a ORing in time 2n. 

Proof : Using standard techniques, a computation of a Line A of n processors in time t(n) can 
be executed by an ORing B in time 2t(n), provided that the initial configuration of A can be 
reached in one step from the initial configuration of B. We informally use an induction on the 
number of steps. Let statesii + 1, 1) = stateA(i, 1) and statesiO, 1) = stateA(n, 1) and assume 
that statesii + t, 2t) = stateA(i, t). (To be more precise, since the cell i + t of B has to simulate 
the cell i of A, then when i = or when % = n — 1 the state of the cell i + t of B encodes a 
state of A and the information that the simulated cell is the leftmost or the rightmost in the 
line). Now the cell i of A at the next step needs the states of cells i — 1 and i + 1 at the time t. 
Cell (i — 1) + 1 of B passes its own state p to the cell (i + t) and this in turn forwards p along 
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with its state to the right neighbouring cell, the cell (i + 1) + t. This last cell can simulate the 
behaviour of the cell i of A at the step t+1. Thus statesii + t + 1, 2(i + 1)) = statej^ii, t + 1). 
The overall simulation takes thus a multiplicative delay factor of two. 

Let us consider now a Two-End synchronization S of a Line. It takes time n and a synchro- 
nization of an ORing in time 2n can be obtained with the above simulation. Actually, in the 
first step it lets the second cell enter a General state, so that the state of the cell i + 1 after the 
first step is equal to the state of the cell % in the starting configuration of S. ■ 

Lemma 11 There is a synchronization of a Square of ORings in time 3n — 1. 

Proof : We will first give an easier to describe solution which takes time 3n and then we show 
how to save one time unit. 

Using standard techniques (as in the previous proof), any computation of a Square A in 
time t(n) can be executed by a Square of ORings B in time 3t{n) in the following way. We 
informally use an induction on the number of steps. Assume that the cell (i + 1, k + 1) in the 
third configuration of B contains the state that the cell (i, k) has in the first configuration of A 
and that cell (i + j,k + j) of B at the time 3j has the state that cell (i, k) of A has at the time 
j. Actually, when the cell (i, k) is a border cell, i.e. when either i E {0, n — 1} or k £ {0, n — 1}, 
also this information is stored in the state of the cell (i + j,k + j) of B. Now the cell (£, k) of 
A at the j-th step computes the new state from its own state and the states of cells (i — l,k), 
(i, k — 1), (i + 1, k) and (i, k + 1) at time j. Within three steps the cell (i + (j + l),k + (j + 1)) 
of B can collect the states that at time 3j are in the cells (i + j,k + j), ((« — 1) + j, k + j), 
(« + j, (& — 1) + j), ((i + 1) + j, k + i) and (< + j, (fc + 1) + j). Namely: 

1. at step 3j, cell (i + j,k + j) of -B stores the two states p, q of cells ((i — 1) + j, k + j) and 
(i + i, 

2. at step 3j + 1 the states p, q are passed to cells ((i + 1) + j, + j) and (i + {k + 1) + j) 
(note that in the previous step the state of cell (i + j,k + j) at time 3j has been passed to 
these cells); 

3. at step 3j + 2, cell ((i + 1) + j, (fe + 1) + jf) simulates cell (i, k) of A at step j. 

So the state of the cell (i + (j + 1), k + (j + 1)) of B at time 3j + 3 contains the state that the 
cell (i, k) of B has at time j + 1. The overall simulation takes thus a multiplicative delay factor 
of three. 

Let now A be a Four-End synchronization as in Lemma El Recall that in this automaton, 
the Square is seen as organized in concentric frames (see Figure 0J) which are synchronized 
at the same time n. We can get a Square of ORings A 1 which in the first two steps reaches 
a configuration such that the states of all the cells (0, 0), (0, 1), (1, 0) and (1,1) contain the 
General state (recall that the states of the cells (0, 0), (0, n — 1), (n — 1, 0) and (n — 1, n — 1) in 
the starting configuration of the solution S are all the General state). Then A' simulates the 
solution A within time 3n. 

Now let us briefly explain how A' can be modified to save one step, thus reaching time 
3n — 1. The first 3n — 3 steps (and thus the first 3n — 2 configurations) remain unmodified. Let 
us observe what follows: 

1. Each cell of A in the configuration j participates for the synchronization of the frame 
which it belongs to; actually each cell participates either only for a row line or only for a 
column line of the frame except for the four corner cells of the frame which participate for 
both the lines. The same holds also for A' in the configurations 3j (due to the mapping 
between the cells of the configuration j of A and those of configuration 3j of ^4'). 
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2. At time 3j + 2 in A', 1 < j < n, a cell (i + (j + 1), k + (j + 1)) is aware of the states at 
time 3j of the following cells: 

a) + + + + (i+(j + i),(A;-2) + (j+l)), (i + (j + l),(fc-l) + (j + l)) 
and (i+(j + l),k + (j + l)); 

b) ((i-l) + C7 + l), (Ar-1) + C7 + 1)), ((i-2) + (j + l),fc+(i+l)), ((i-l) + (j+l),fc+(i+l)) 
and + + + + 

Thus at step 3n — 2, the cell (i + n,k + n) can correctly simulate either cell (i, k — 1) or cell 
(i — 1, k) of £ at step n — 1, hence entering the Firing state. In particular the cell (i + n, k + n) 
simulates the former if (i, k — 1) participates to the synchronization for a row line, or simulates 
the latter, if (i — l,k) participates to the simulation for a column line (note that at least one 
of these conditions must hold). Then, there is a Square of ORings inch is a synchronization in 
time 3n — 1. ■ 

We can give now the main results of the section. 

Theorem 2 

• There is a synchronization of an ORing in time In and every synchronization of an ORing 
has time greater than or equal to 2n. 

• There is a synchronization of a Square of ORings in time 3n — 1 and every synchronization 
of a Square of ORings has time greater than or equal to 3n — 1. 

4 Signals 

The framework of a signal has been introduced in ^2] to simplify the design of a c-Line. This 
innovative definition provides a way to modularize the design of solutions. Informally speaking, 
a signal is a particular set of cells that at a given time receives/sends a word different from 
from/to the adjacent cells. In other words a signal describes the information flow in the space- 
time unrolling of a cellular automaton, allowing a modular description of the synchronization 
process, that is starting from basic signals we combine different signals to obtain new ones to 
describe in a more natural way the synchronizing algorithms. (Let us note that also in jHj and 
|16j the signals were used, anyway there the intended meaning was different). The scheme used 
to present some synchronization algorithms in time t > 2n — 1 for a c-Line of n processors is the 
following: some signals are designed and composed to obtain an overall signal that starts from 
the leftmost processor and comes back to it in exactly (t — 2n + 1) time units; then a minimal 
time synchronization starts, thus synchronizing the n processors in time t. 

We consider the time unrolling of a c-Line A a configuration C. Define the time i™ ax = 
m&x{t\(i,t) is active} and tf 111 = min{i|(i,t) is active}. Consider the set of all cells i such that 
there exists at least an active site (i, t) of (A, C), for such cells i the set of sites (i, tf m ) is called 
the rear of (A, C) and the set of sites (i,t™ ax ) is the front of (A,C). Moreover we say that 
(A,C) is tailed if there exists a subset of Q, called tail(A,C) such that for all i E {1, ... ,n}, 
state(i, t) € tail(^4, C) if and only if (i, t) belongs to the front of (A, C). The states in tail(vl, C) 
are called tail states. In words, a tail state appears for the first time (in the time unrolling of 
A) on the front of (A, C). 

Two active sites (ii, t±), fa, £2) are consecutive if ti =t\-\-\ and ii G {i\ — + 1}. 

A simple signal of (A, C) is a subset S of temporally consecutive sites with the property that 
if (A,C) is tailed, then (z,i™ ax ) belongs to S. The union of a finite number of simple signals 
of a given (A,C) is called signal of (A, C). A graphical representation of a simple signal S is 
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obtained by drawing a straight line between: 

(i) every pair of sites (i,t) G S and (i, t + 1) G S and 

(ii) every pair of sites (i,t) G S and (i + 1, i + 1) G 5 (resp. (i — l,t + 1) £ 5) if right(i,t) = 1 
(resp. Ze/£(z, £) = 1). 

A graphical representation of a signal is obtained by the graphical representation of its simple 
signals. The length of a signal S is (t max - t min + 1) where t max = max{t|(i,i) G S, 1 < z < n} 
and t min = min{t|(i,t) 6 S, 1 < i < n}. Sometimes, in the rest of the paper we refer to a signal 
without specifying an automata and a starting configuration. 

The following examples show two signals: Max and Mark. The former is the "fastest" signal 
(it touches one new cell each time unit), while the latter will be used to check the occurrence of 
an event (generally a signal crossing a given cell) thus if it is this case, triggering a new signal 
(see Figure 0}. 

Example 1 Let i ^ j and Max(z, j) be the set containing the sites (i + h, h + 1) if i < j, or 
sites (i — h, h + 1) otherwise, for < h < \i — j\ + 1. This set is a simple signal, with length 
\i — j\ + 1, of a tailed c-Line that starts from a configuration having the states of cells i and j 
different from all the others. 

Example 2 Given a positive constant k < n, the signal Mark(ti — k) is used to mark the cell 
n — k. The length of the signal Mark is n + k (see Figure [3J). It can be easily seen that Mark 
is a signal of a tailed c-Line. 



123456789 10 cells 123456789 10 




time 

Figure 4: The signals Max and Mark. 
4.1 Composition of Signals 

Signals can be composed in order to obtain new ones. Given two signals S\ and 52, we define 
the concatenation cat r (5i, S2) as the signal obtained by starting S\ at time 1 and 62 at time 
r + 1, that is S2 is delayed r time steps. More formally, cat r (Si, S2) = <Si U{(i,t + r)\(i, t) G S2}. 
In the concatenation of signals the following property is crucial. We say that a c-Line A2 on 
C<i can follow a tailed c-Line A\ on C\ if there exists a function h defined over tail(Ai, C\) and 
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such that h(p) = Cz(i) if p = state(i,t). When this property holds it is possible to switch from 
the front of (A^Ci) to C 2 . 

The following lemma recalls some sufficient conditions for the existence of a tailed c-Line for 
a signal cat r (Si, £2). 

Lemma 12 Let S\ and S2 be signals of the tailed c-Lines {A\ } Ci) and (^42>C2), respectively. 
The signal S = cat r (Si, S2) is a signal of a tailed c-Line (A, C\) if the following conditions hold: 

1. (^2,62) can follow (Ai,C\); 

2. if a site (i, t) belongs to the front of (Ai, C\) and (i, t') belongs to the rear of (A2, C2), then 
t<t' + r; 

3. if sites (i, 1) and (j, 1) belong to the rear of (A 2 , C 2 ) then i™ ax = t™ ax in (A x , Ci). 

Proof : Let (i, t) be a site of a c-Line such that t is the if 13 * in (A\, C\) and (i, 1) belongs to the 
rear of (A2, C2). Define s as r — t + 1. By the above property 3, this constant s is well defined, 
and by the above property 2, it is greater than 0. A tailed c-Line (A, C\) for S = cat r (5i, S2), 
can be obtained in the following way. At the beginning A behaves as A\. On the states from 
tail(^i, Ci), A counts up to s — 1 and then enter the corresponding state of C2. We recall 
that this step is well defined since s is a positive constant and the above property 1 holds. At 
this point A behaves as A2- Clearly, (A,C\) is tailed and S is a signal of (A, C\). Notice that 
if there are cells corresponding to active sites of {A21C2) which do not correspond to active 
sites of (A\, Ci), from the above properties we have that in both configurations C\ and C2 they 
correspond to quiescent states. ■ 

4.2 Non trivial signals 

We introduce here two non trivial signals of a c-Line that will be used to get the main syn- 
chronization solutions of the section. The first has a quadratic length and the second has an 
exponential length in the number of cells. In particular from Proposition ^ it is sufficient to 
consider only the case c = 1 (which is also the most difficult). For technical reasons in this 
section (and also in section we will number the first cell as cell number 1 (instead of as said 
in the preliminaries). 

The signal Quad. Given a positive constant k < n, QuAD(n — k) is a signal of a 1-Line A 
which is described as follows: 

• initially the cell 1 sends a bit 1 to the right; then if it receives a bit 1 from the right, it 
sends with a delay of one step (except for the first time, when there is no waiting), a bit 
1 back to the right; the cell 1 eventually halts when it receives two consecutive bits 1; 

• for 1 < h < (n — k), the cell h sends a bit 1 to the left when it receives for the first time a 
bit 1 from the left; then, if the cell h receives again a bit 1 from an adjacent cell, it sends 
a bit 1 to the other adjacent cell; 

• the cell (n — k) sends two consecutive bits 1 to the left when it receives a bit 1 from the 
left. 

Notice that the designed 1-Line A can be tailed as well: in fact the cells from 1 to (n — k) 
can enter a tail state when they receive two consecutive bits 1. The length of the Quad signal 
is (n — k) 2 — 1. 

Let us note now that for the implementation of this signal the cell (n — k) needs to be 
distinguished. In what follows we will use only QuAD(n — 2) in theorem 0] and Quad(?i — 1) 
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Figure 5: The signal cati(QUAD(n — 2), Mark(« - 2)). 

in theorem |21 thus we only need to distinguish cells (n — 2) and (n — 1): this can be done by 
Mark(h — 2) and Mark(w — 1), for n > 5. For smaller n much easier and ad hoc algorithms 
can be given (see Figure 0). 

The signal Exp. Given two positive constants k and d, we will define the signal Exp(n — k, d). 

An idle cell is a cell which never sends a bit 1 unless it receives a bit 1 from the left and in 
this case it sends two consecutive bits 1 to the left. 

Initially the only idle cell is the cell in — k). Exp(n — k,d) is a signal of a 1-Line which is 
described as follows: 

• first cell 1 sends a bit 1 to the right; then, whenever cell 1 receives a bit 1 from the right, 
it immediately replies sending back a bit 1; finally, if cell 1 receives two consecutive bits 1 
from the right, then it changes into an idle cell; 

• for 1 < h < (n — k), we distinguish two cases: 

— if the bit is received from the left then it alternates the following two behaviours: 

1. it sends a bit 1 back to the left; call these peak cells (though this is a property of 
the state entered by this cell.) 

2. it sends a bit 1 to the right; 

each peak cell starts counting from 1 to 2 i+1 — 2, for 1 < i < d. When 2 i+1 — 2 has 
been just counted, if the peak cell receives a bit 1 from the left at the next time unit, 
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then it is the i-th cell in the line and is marked (see below for an explanation). This 
way it can be distinguished later. 

— if a bit 1 is received from the right, then it sends a bit 1 to the left. If at the next 
time unit cell h receives another bit 1 from its right neighbour, then two other sub 
cases need to be considered: 

if h > d then the cell switches into an idle cell; 

else, for h < d, the cell sends two consecutive bits 1 to the left. (Note that when 
this case occurs, cells h < d have already been marked by step 2 above.) 

From the algorithm we have just described, a proof by induction on i < d can be given to 
show how a peak cell can be marked, in fact the following property holds: the length of the 
interval from the instant cell i is a peak cell for the first time and the instant it becomes a peak 
cell for the second time is 2 1 + S}=i — j) (see Figure H3 where d = 3, cell 2 is marked at time 
9 and cell 3 is marked at time 20). 



1 2 3 4 5 6 7 cells 1 2 3 4 5 6 7 cells 




time 

Figure 6: The signals cati(Exp(5, 3), Mark(5)) and cati(EXP(5, 1), Mark(5)) 

To implement a tailed 1-Line for Exp(n — k, d) initially the cell (n— k) must be distinguished. 
In what follows we will use the signals Exp(n — 2, •) and Exp(n — 1,1): the cells n — 2 and n— 1 
can be distinguished by using Mark(w — 2) and Mark(tj — 1), for n > 5. Observe also that 
the cells from 1 to (n — k) can enter a tail state after they received two consecutive bits 1. The 
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length of Exp(n — k, d) is 2 n k+1 — 2(n — k) — 2 d+1 + 2(d + 1) (see FigureEJ). In a very similar 
way we can define the signal E(n — k) of length 2 n ~ k+1 + 1 (see Figure |7J). 

1 2 3 4 5 cells 




time 

Figure 7: The signals E(n — k) 



5 Composition of synchronizations 

The design of synchronizations in times which are not minimal may not be obvious. A compo- 
sitional approach to achieve this task is thus very useful. In this section we discuss several ways 
to combine two or more synchronizations of the models of networks we consider. We start with 
a parallel composition, then we study a sequential and an iterated compositions. 

In the following, if Si is a synchronization of a c-CA then Gi, Li and Fi denote the General, 
Latent and Firing states of Si and Qi respectively, <5, denote respectively the set of states and 
the transition function. We use the cross product of automata as a mean to combine c-CA. 
Given a ci-Line A\ and a C2-Line A2, we denote as A± x A2 the (ci + C2)-Line defined as the 
standard cross product of A\ and A^. Notice that in the construction we keep distinct the 
communication links of the two lines and thus A\ x A2 allows to run in parallel synchronizations 
of a ci-Line along with synchronizations of a C2-Line. This construction is extended to all the 
other models we consider in an obvious way. We slightly modify the cross product construction 
to design a synchronization that selects among two different synchronizations according to a 
given condition P(n). Examples of such conditions are the parity of the number of processors 
and the fastest/slowest synchronization. We define a selecting c-Line in time t{n) as a c-Linc 
whose state set contains two disjoint subsets 0\ and O2, called the selection subsets, such that 
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starting from a standard configuration its configuration at any time t > tin) only contains either 
states from 0\ or states from O^. This definition is extended to all the other models we consider 
in an obvious way. The following lemma shows how to design a c-CA that selects between two 
given synchronizations according to a condition on the number of cells. Clearly by iterating this 
construction, a selection among more than two synchronizations can be obtained. 

Lemma 13 For i = 1,2, let Si be a synchronization on a Ci-CA in time U(n), and K be 
a selecting ck-CA in time t{n) < ti(n) with selection subsets 0\ and O2. Then there exists a 
synchronization on a c'-CA in time t'(n) such that d = ck+c\+C2, moreover if any configuration 
of K at time t > t{n) contains only states from 0\ then s{n) = t\(n), otherwise s{n) = t^ip). 

Proof : Let S be the c'-CA obtained by modifying K x S\ x S2 in the following way: for 

1 = 1,2 if a cell is entering Fi and the selecting automaton K is in a state from Oi then it 
enters the firing state of S. Clearly if S starts on a configuration which is composed of triples 
of corresponding states of the standard configurations for K, Si and S2, then S synchronizes in 
the claimed time. ■ 

As applications of the above lemma we show two examples. In the first example we face 
with the problem of obtaining a synchronization which synchronizes at the maximum or at the 
minimum time between two synchronizations. We first define a selecting CA performing the test 
ti{n) < t2(n), then we show that this selecting CA can be used to obtain a synchronization in 
either the maximum or the minimum time between two synchronizations. In the second example 
a particular behaviour is selected depending on the result of a comparison between the number 
of processors n and a constant h. 

Example 3 For i = 1,2 denote by Si a synchronization in time ti(n). We define a selecting 
CA K for the condition t\{n) < t2(n) in time t(n) = min{ti(n), t2(n)}. The CA K is mainly 
the cross product of Si and S2 with the modification that once a synchronization enters the firing 
state, K loops on this state. Thus we pick Oi = {Fi} and O2 = {^2}- Thus by Lemma U^ we 
have a synchronization in time ti{n), ifti(n) < t2{n), andt2{n), otherwise. Thus a synchroniza- 
tion in the minimum time between ti(n) and t2(n) is obtained. If we pick instead Oi = {F2} and 

02 = {-^i}, then a synchronization in the maximum time between ti(n) and t2(n) is obtained. 

Example 4 We describe a selecting CA K performing the test n < h, for a given positive 
integer h. Let Q = {G,L,pi, . . . ,ph, p<h,P>h} such that G and L are the General and Latent 
states respectively, and Oi = {p<h} and O2 = {p>h}- In the linear models the transition function 
can be informally described as follows. For the two-dimensional models K can be described in 
an analogous way. In the first step cells and 1 enter states pi and P2 respectively; next, each 
cell in the Latent state enters the state pi + i if its adjacent cell on the left is in a state pi for 
i < h, while it enters the state p>/j if this neighbour is in the state p^; if each cell is in a state pi 
for some i < h thus p<h is propagated up to cell (this takes just a step in a ORing and n — 1 
steps in a Line since this is the case if cell n — 1 is in a pi for i <h). When a processor enters 
the state p<h or the state p > h all the other processors are forced to enter the same state within 
a time n. Obviously, K is a selecting CA in time t(n) = n+min{h,n} . 

Note that the selecting CA from the Example can be used for any pair of synchronizations, 
as the time of the selecting CA is not larger than the time of any synchronization. 

In the next two lemmas we show how to compose two synchronizations in time ti{n) and 
t2{n) respectively, to obtain synchronizations in time ti{n) + t2(n) + d, for a given constant d, 
and in time ti{n)t2{n). 

Lemma 14 // 5i and S2 are two synchronizations on a c-CA respectively in time ti(n) and 
t2{n), then there exists a synchronization on a c-CA in time ti(n) + t2(n) + d for d > 0. 
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Proof : We define a synchronization S such that S behaves as Si from time 1 up to time 
ti(n), then at time ti(n) + 1 it switches to S^. Thus S is a synchronization in time t\{n) + t2(n). 
Furthermore, given a synchronization S' in time t(n) and with Firing state Fq, a synchronization 
in time t{n) + d can be obtained from S' by adding the states F[, . . . , F' d and the transition rules 
from F[ into FL-y for i = 0, ...,d — 1, and picking as the Firing state of the resulting 
synchronization. ■ 

Lemma 15 // 5i and S2 are two synchronizations on a c-CA respectively in time t\(n) and 
t2(n), then there exists a synchronization on a c-CA in time ti(n) • t2(n). 

Proof : We prove the above result for a 1-Line. The proof is similar ofr all the other models. 
We define a synchronization S consisting of an Iterative phase of length t\(n) which is executed 
t2(n) times. The set of states of S is Qi x Q2 x {0, l} 2 , the General state is (Gi, G2, 0, 1), 
the Latent state is (Li,L2,0, 0) and the Firing state is (iq, -F2, 0, 0). In the Iterative phase, the 
synchronization S modifies the first component of its state according to the transition functions of 
Si, until this component is F\. At the end of this phase S executes a transition step modifying the 
second component of the state according to the transition functions of S2. The bits sent according 
to transition function of S2 are saved in the last two components of each state according to the 
order left, right. Moreover, in this same step, S replaces F\ with either Gi or Li (depending on 
whether the cell is the one triggering in the initial configuration the firing signal of Si) in the 
first component. So the Iterative phase can start again, until the Firing state is entered by all 
the cells. So, the synchronization Si is iterated exactly t2(n) times and S takes time ti(n)t2(n). 
■ 

Finally we show a construction that allows to obtain synchronizations on a c-Square in time 
t(2n — 1) provided that there exists a synchronization of on a c-Line in time t{n). 

Lemma 16 Given a synchronization on a c-Line in time t(n), there exists a synchronization 
on a c-Square in time t{2n — 1). 

Proof : An (n x n) array can be seen as many lines of (2n — 1) cells, each of them having as 
endpoints cells (0,0) and (n — l,n — 1). Each of these lines corresponds to a "path" from cell 
(0,0) to cell (n — l,n — 1) going through exactly (2n — 3) other cells. Each cell of these 
paths has as left neighbour either cell (i — l,j) or cell — 1) and as right neighbour either 
cell (i + or cell + 1). 

Notice that a cell (i, j) is the (i + j — l)-th cell from the left in all the lines it belongs to. 
This property allows us to execute simultaneously on all these lines a synchronization in time 
t(n). Since the length of each line is (2n — 1), we have a synchronization of c-Square in time 
t(2n-l). ■ 



6 Two-way communication Networks 

In this section we compose the signals presented in the previous section to obtain solutions in 
time n 2 , 2 n , n [logn] and nf-^/n] on a 1-Line and on a 1-Square. Clearly these give as a corollary 
solutions in the same time for the c-Line and c-Square and for the circular c-Ring and c-Square 
of Rings. 

For technical reasons we start numbering cells from 1 (instead of 0). 
Theorem 3 There is a synchronization of a 1-Line in time n 2 . 

Proof : The solution is divided into two phases. The first phase consists of cati(MARK(n — 
1), Quad(ti — 1)) and has length (n — l) 2 as QuAD(n — 1) is delayed one time step, see FigureEl 
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By Lemma El this phase is a signal of a tailed 1-Line starting from a standard configuration. 
Hence cell 1 has entered a tailed state, say G' and considering this as the general state, a minimal 
time solution on a line is started, one step later: this is the second phase. Together the two 
phases give a solution to the FSSP in time n 2 . ■ 

Theorem 4 There is a synchronization of a 1-Square in time n 2 . 

Proof : The algorithm is the following: first a signal cati(MARK(n — 2), QuAD(n — 2)) is started 
on the first row, the length of this signal is (n — 2) 2 since QuAD(n — 2) is delayed one time step. 
This is a signal of a tailed 1-Line starting from a standard configuration (see Lemma 112(1 . Thus 
after (n — 2) 2 time units the cell (1, 1) enters a tail state, say G' . Considering G' as the General 
state, a minimal time synchronization on a linear array of n cells is executed on the first row 
and this takes other (2n — 2) time units. Once the Firing state F' is reached, we use F' as the 
General state of a minimal time synchronization that this time runs on each column, thus taking 
another (2n — 2) time units, which adds up to a total time of n 2 . ■ 

Theorem 5 There is a synchronization of a 1-Line in time 2 n . 

Proof : The solution is divided into two phases. The first phase consists of cati(MARK(n — 
1), Exp(n — 1, 1)) and has length 2 n — 2n + 2 see Figure |HI By Lemma IT21 this phase is a signal 
of a tailed 1-Line starting from a standard configuration. Hence cell 1 has entered a tailed state, 
say G' and considering this as the general state, a minimal time solution on a line is started: 
this is the second phase. Together the two phases give a solution to the FSSP in time 2 n . ■ 

Theorem 6 There is a synchronization of a 1- Square in time 2 n . 

Proof : First a signal cati(Exp(n — 2, 3), Mark(h — 2)) is started on the first row, see FigureEl 
After (2 n_1 — 2n — 3) time units the cell (1, 1) enters a tail state, say H. This is a signal of a 
tailed 1-Line starting from a standard configuration (see Lemma IT2]l. Now the cell (1, 1) enters 
a state G' and a minimal time synchronization on the first row is accomplished, using G' as the 
General state, thus taking other {2n — 1) time units. Once the Firing state F' is reached, each 
cell of the first row enters a state G" , and launches the signals Mark(h — 2) and Exp(ra — 2, 1) 
on each column, using G" as the General state. This takes another (2 n_1 — 2n + 5) time units, 
which sums up to time (2 n — 2n + 1). Finally, a minimal time synchronization on each column 
is accomplished, thus reaching time 2 n . ■ 
The proof of the existence of a synchronization of a 1-Line in time n[logn] and in time 
n[\/ra] is quite involved and long, see |12| . Here we recall the synchronization for the 1-Square. 

Theorem 7 There is a synchronization of a 1-Square in time ra[logn] and in time rafy'n]. 

Proof : The algorithms resemble those used to synchronize a line of n cells at the same 
times of |12j . Therefore here we only outline the main idea. For the synchronization in time 
n [log n] , we use a signal to synchronize the first row in time (n log n — 2n) and then we apply 
a synchronization to each column in time 2n (just a minimal time synchronization for a linear 
array with one more time unit). 

Let us informally describe the synchronization of the first row. Initially the cells numbered 
(1,5),(1, [n/2]), (1, \n/2\ + 1) and (l,n — 4) are marked: this can be easily accomplished in 
time 2n. This way the row can be seen as split in two halves and for each half a symmetric 
computation is done, therefore we will describe only the left half. A phase is iterated ( [log n \ — 5) 
times: each iteration starts at time ((i + l)n + 1), 1 < i < (log n — 5), and has length n. During 
the i-th. iteration, the test {i + 5) > [log n] , is performed in the following way: a signal of length 
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2(*+ 5 ) on the linear array consisting of the first (i + 5) cells and a signal Max of length n, which 
is composed of Max(1, [n/2]) and Max([Yi/2] , 1), are performed (see Figure |HJ). We compose 
the two signals to give Max a higher priority, thus if the exponential signal reaches a cell after 
the Max signal, it is aborted. In this case the Max signal finishes earlier than or at the same 
time as the exponential signal, and this means that (i + 5) > logn and thus this is the last 
iteration. Otherwise (that is Max finishes later) cell (i + 1) is marked and a new iteration starts 
(see Figure IHJ- Omitting minor details, at the end of the last iteration all cells are forced in tail 
states, so determining a standard configuration for a synchronization of a linear array of [n/2] 
cells in time n. The synchronization in time n[y^n] can be obtained in a very similar way by 
considering a quadratic signal, instead of an exponential one, to synchronize the first row in 
time (n^fn — 2n). ■ 
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Figure 8: The phase in the i-th iteration, i > 1 and n odd of the synchronization in time n [log n] 



Corollary 1 There are synchronizations of a c-Line, c- Square and a c-Ring, c > 1, in time n 2 , 
2 n , n[logn] and in time n\^/n~\. 

7 One- Way Communication Networks 

In this section we give synchronization algorithms for the circular networks, ORings and Square 
of ORings, in time n 2 , n[logn], n[\/n] and 2™. The algorithms in time 2 n is obtained by con- 
verting a solution for a CA. As in the previous section for technical reasons, we start numbering 
cells from 1 (instead of 0). 

The following theorem gives the solution in time n 2 . 

Theorem 8 There is a synchronization of a ORing and of Square of ORings in time n 2 . 

Proof : First consider the ORing. Assume n > 3, the case n < 3 can be dealt with a simple ad 
hoc strategy and we omit it (Xemma ll3l can be used with the test n > 3 to select the behaviour, 
see Example HJ). The algorithm is very intuitive, thus we will not give the details of the signals. 

The solution is divided into two phases: the Counting and the Synchronization phases. The 
Counting phase has length (n — 2)n + 1 and can be seen as constituted by n — 2 iterations of 
a sub-phase of n steps. This sub-phase is simply a Max signal going all along the ring, from 
the first cell to the last. In the first iteration the cell number 3 is marked with a marker M and 
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at each successive iteration M is moved one cell to the right, so M is moved to the first cell 
when n — 2 iterations have been executed, that is at time (n — 2)n + 1. This phase is a signal 
of a tailed 1-Line starting from a standard configuration (see Lemma I12|) . The Synchronization 
phase consists of a minimal time solution (in time 2n) on a ring and can start exactly at time 
(n — 2)n + 1 and the total solution has thus length n 2 . 

Now let us consider the Square of ORings. Here assume n > 5 and, as before, Lemma IT3*1 is 
used to select the behaviour. The solution in time re 2 is easily obtained through the following 
two steps: 

• the first row is synchronized in time 2n with a minimal time solution on a ring; 

• a solution in time re 2 — 2n is applied to each column. 

The solution in time n 2 — 2n is easily obtained from a solution in time n 2 on a ring and 
modifying the first iteration of the Counting phase in order to mark the cell 5 (instead of cell 3). 
In this way the Counting phase is constituted by n — 4 iterations of the sub-phase, thus saving 
2n steps. ■ 

Now we show how to obtain a solution in time nlogn. Let us recall that by Lemma|5]there is 
a two-end synchronization of a c-Line in time n. Notice that it is easy to modify this algorithm 
in such a way that in the (n — l)-th configuration the processor [re/2*] — 1, for a given i, is in 
a particular state which is different from all other states entered by the any processor (actually 
this is a signal of type Mark). A similar result can be easily obtained for the ORing as well. 

Lemma 17 There is a synchronization of a ORing in time 2n such that in the configuration 
2n — 1 the processor [re/2 1 ] — 1, for a given i > is in a particular state which is different from 
the state of any other processor. 

Now we can give the synchronization in time [re log re] . 

Theorem 9 There is a solution of an ORing and of a Square of ORings in time n [log n] . 

Proof : First consider the ORing. Let us assume for the moment n > 8. The solution is 
divided in three phases: the Initialization, the Iterative and the Synchronization phases. The 
Iterative phase is executed if re > 16, otherwise it is skipped. Informally speaking the whole 
solution is described as follows. 

In the Initialization phase the cell [re/16] — 1 is marked with a particular state, call it marker. 
Then the cell is marked if and only if re < 16. Using Lemma El this phase can be realized in 
time 2re. 

In the Iterative phase at the i-th iteration the marker is moved from the cell [re/2 l+3 ] — 1 
to the cell [n/2* +4 ] — 1 for i = 1, . . . , [log n] — 4 and again the cell is marked if re < 2 J+4 . The 
i-th iteration starts at time (i + l)re + 1 and ends at time (i + 2)n + 1. Note that the first step 
of the i-th. iteration coincides with the last step of the (i — l)-th iteration. Thus the total time 
taken by this phase is re([logn] — 4) + 1. The third phase is actually a minimal time solution. 
Thus, the total time is 2n + n([logre] — 4) + l + 2n — 1 = re [log re] . 

The case re < 8 can be easily solved with a particular strategy and the appropriate behaviour 
can be selected by using Lemma ITTfl 

Now let us consider the Square of ORings. Here assume n > 32 and, as before, the Lemma [TBI 
is used to choose the behaviour. The solution in time re [log re] is easily obtained through the 
following two steps: 

• the first row is synchronized in time 2n with a minimal time solution on an ORing; 

• a solution in time re [log re] — 2n is applied to each column. 
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The solution in time n [log n] — 2n is easily obtained from the solution in time n [log n\ on a 
Ring by modifying the Initialization phase in order to mark the cell [n/64] — 1 (instead of cell 
[n/16] — 1) thus saving 2n steps. ■ 



Theorem 10 There is a synchronization of an ORing and of a Square of ORings in time 2 n . 

Proof : A synchronization for an ORing in time 2 n_1 can be obtained from Theorem |S] by 
putting a General state in the second cell and then starting a synchronization on n — 1 cells. In 
an analogous way it is possible to obtain a solution in time 2 n ~ 2 . Using standard techniques as 
in Lemma ^ any computation of a Line A in time t(n) can be executed by a Ring B in time 
2t{n) — 1. In fact, assume that cell i + j — 1 of B at time 2j — 1 has the state that cell i of A 
has at time j. Now the cell i of A at step j needs the states of cells i — 1 and i + 1 at time j. 
Cell (i — 1) + (j — 1) of B at step 2j — 1 passes its own state p to the cell (i + (j — 1)) and this 
forwards p along with its state to the right neighbouring cell, the cell (i + 1) + (j — 1), that at 
step 2j can simulate cell i of A at step j. Now by this simulation and Theorems El and |H1 for the 
ORing, synchronization algorithms in time 2 n and 2 n , respectively, are achieved. Moreover, 
a synchronization of a Square of ORings in time 2 n can be obtained by first synchronizing the 
first row in time 2 n_1 and then all the columns, with the same algorithm as well. ■ 

8 Composed solutions 

In this section we briefly give some new synchronizations on a c-Square using known algorithms 
to synchronize a c-Line. Then, we show how to construct synchronizations in any time expressed 
by polynomials with nonnegative integer coefficients. 

In section we have given synchronizations for a c-Line in the following times: n 2 , 2 n , 
n[logn], and n[y / n]. Combining these results with the Lemma El we can give the following 
corollary. 

Corollary 2 Let K = 2n — 1, there are synchronizations on a c-Square in time K 2 , 2 K , 
K [log if], and K\VK]. 

The following lemma is crucial to obtain synchronizations in polynomial time. 

Lemma 18 Given a synchronization on a c-CA in time t{n) there exist synchronizations in 
time t{n) + n and n ■ t(n). 

Proof : From Lemma |S1 there exists a synchronization on a c-Line in time n, if the starting 
configuration has the General at both the endpoints. We have shown in section 13.31 that there 
exists a synchronization on a c-Square in time n if the starting configuration has the General 
at all the four corners. Clearly these synchronizations hold respectively on a c-Ring and on a 
c-Square of Rings. To obtain a synchronization in time n on a c-ORing, we split the ring in two 
halves and run the above synchronization on a c-Line in time n on both the halves at the same 
time. This thus requires to start from a configuration where the General is at cells 0, ^i— , n — 1, 
if n is odd and at cells 0, LnpJ> f^Tr"!) n ~ 1> otherwise. A synchronization in time n on a 
c-Square of ORings can be obtained running the above solution on all the rows at the same time 
and starting from a configuration where the General is for i = 0, . . . , n— 1 at cells (i, 0), (i, 
(i,n — 1), if n is odd and at cells (i,0), (i, L^^J), (i, p^])) {h n ~ !)> otherwise. Since on the 
various models it is possible to mark in time t(n) all the cells we need to enter the appropriate 
configuration for the above synchronizations in time n, we have that by Lemmas 1141 and 1151 the 
claimed synchronizations in time t{n) + n and n ■ t(n) can be constructed. ■ 
Thus we have the following theorem. 
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Theorem 11 Let h > 2 be an integer number and oq, . . . , a/j be natural numbers with ah > 1. 
There is a synchronization in time a^n* 1 + . . . + a\n + ao ori a c-Line, a c-Square, a c-Ring, a 
c-Square of Rings, an ORing, and a Square of ORings. 

Proof : From Corollary ^ Lemma 1181 and Theorem |H1 a synchronization in time n b can be 
obtained for every b > 2. By composing by Lemma El these synchronizations in time n b and 
the minimal time solutions given in sections 13.21 and 13.31 the theorem follows. ■ 

9 Conclusions 

We have presented various techniques to design solutions to the FSSP on different kinds of 
networks. The synchronizing time is given as input to the problem as is expressed as a function 
of the number of nodes. 

The approach of the paper has been that of defining a very formal and precise concept 
of signal and starting from basic signals, give operations to compose them to get other new 
solutions. We have introduced also as a parameter the capacity of the link measured in bits: 
this has allowed us to classify network models in terms of the overhead on the amount of traffic 
on the links. We believe that this approach can lead to the design of other signals for new 
solutions. 

Our study has not concerned the problem of the number of states of the solutions (that in 
the early papers concerning FSSP was of primary concerns). As a future direction of research 
this aspect has to play a primary role. Another kind of interesting, but unexplored, question 
is how to synchronize a c-line with teratologic neighbourhoods (for example (-3,-2,-1,0, 2)), this 
questions may have some connections with open questions of |20| ) . 
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